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Abstract 

We relate here two formalisms that are used for 
different purposes in reasoning about multi-agent sys- 
tems. One of them are strategic games that are used 
to capture the idea that agents interact with each other 
while pursuing their own interest. The other are CP- 
nets that were introduced to express qualitative and 
conditional preferences of the users and which aim at 
facilitating the process of preference elicitation. 

To relate these two formalisms we introduce a nat- 
ural, qualitative, extension of the notion of a strate- 
gic game. We show then that the optimal outcomes 
of a CP-net are exactly the Nash equilibria of an ap- 
propriately defined strategic game in the above sense. 
This allows us to use the techniques of game theory to 
search for optimal outcomes of CP-nets and vice-versa, 
to use techniques developed for CP-nets to search for 
Nash equilibria of the considered games. 



1 Introduction 

One of the main tools in the area of multi-agent sys- 
tems is game theory, notably strategic games. They 
formalize in a simple and powerful way the idea that 
agents interact with each other while pursuing their 
own interest. The interaction is captured by the 
fact that actions (strategies) are taken simultaneously, 
while agents' interests are expressed by means of the 
utility (payoff) function that each agent wishes to 
maximize. 

CP-nets (Conditional Preference nets) are an ele- 
gant formalism for representing conditional and qual- 
itative preferences, see PIE]. They model such pref- 
erences under a ceteris paribus (that is, 'all else being 
equal') assumption. The CP-net represents a complex 
'joint preference distribution' in a compact form. Pref- 
erence elicitation in such a framework appears to be 
natural and intuitive. 



Research on CP-nets focused on its modeling ca- 
pabilities and algorithms for solving various natural 
problems related to their use. Also, computational 
complexity of these problems was extensively studied. 
An outcome of a CP-net is an assignment of values to 
its variables. One of the fundamental problems is that 
of finding an optimal outcome, i.e., the one that can- 
not be improved in presence of the adopted preference 
statements. This is in general a complex problem since 
it was found that finding optimal outcomes and testing 
for their existence is NP-hard in general. In contrast, 
for so-called acyclic CP-nets this is an easy problem 
which can be solved by a linear time algorithm. 

The aim of this paper to show the relationship 
between CP-nets and game theory, and explain how 
game-theoretic techniques developed for the analysis 
of strategic games can be fruitfully used to study CP- 
nets. To this end, we introduce a generalization of 
the customary strategic games (see, e.g., [3],) in which 
each player has to his disposal a strict preference re- 
lation on his set of strategies, parametrized by a joint 
strategy of his opponents. We call such games strategic 
games with parametrized preferences. 

The cornerstone of our approach are two results 
closely relating CP-nets to such games. They show 
that the optimal outcomes of a CP-net are exactly the 
Nash equilibria of an appropriately defined strategic 
game with parametrized preferences. This allows us 
to transfer techniques of game theory to CP-nets. 

To find Nash equilibria in strategic games, reduc- 
tion techniques have been studied which reduce the 
game by eliminating some players' strategies, thus ob- 
taining a smaller game. We introduce two counter- 
parts of such game-theoretic techniques that allow us 
to reduce a CP-net while maintaining its optimal out- 
comes. We also introduce a method of simplifying a 
CP-net by eliminating so-called redundant variables 
from the variables parent sets. Both techniques sim- 
plify the search for optimal outcomes of a CP-net. 

The paper is organized as follows. Section |21 pro- 



vides the basic definitions of CP-nets. Then, Section 

introduces our generalized notion of games, Section 

01 shows how to pass from a CP-net to a game, and 
Section [5] handles the opposite direction. Then, Sec- 
tion HH introduces the concept of reduced CP-nets, and 
Section0shows how to exploit techniques developed in 
games for CP-nets. Finally, Section |H1 summarizes the 
main contributions of the paper and discusses current 
and future work. 

2 CP-nets 

CP-nets E] (for Conditional Preference nets) are 
a graphical model for compactly representing condi- 
tional and qualitative preference relations. They ex- 
ploit conditional preferential independence by decom- 
posing an agent's preferences via the ceteris paribus 
(cp) assumption. Informally, CP-nets are sets of ce- 
teris paribus (cp) preference statements. For in- 
stance, the statement "I prefer red wine to white wine 
if meat is served." asserts that, given two meals that 
differ only in the kind of wine served and both con- 
taining meat, the meal with a red wine is preferable 
to the meal with a white wine. On the other hand, 
this statement does not order two meals with a differ- 
ent main course. Many users' preferences appear to 
be of this type. 

CP-nets bear some similarity to Bayesian networks. 
Both utilize directed graphs where each node stands 
for a domain variable, and assume a set of features 
(variables) F = {X\, . . . , X n } with the corresponding 
finite domains T>{X\), . . . ,T>(X n ). For each feature Xi, 
a user specifies a (possibly empty) set of parent fea- 
tures Pa(Xi) that can affect her preferences over the 
values of X,-. This defines a dependency graph in 
which each node Xi has Pa{Xi) as its immediate pre- 
decessors. 

Given this structural information, the user explic- 
itly specifies her preference over the values of Xi for 
each complete assignment on Pa(Xi). This preference 
is assumed to take the form of a linear ordering over 
D{Xi) Each such specification is called below 

a preference statement for the variable Xi. These 
conditional preferences over the values of Xi are cap- 
tured by a conditional preference table which is anno- 
tated with the node Xi in the CP-net. An outcome 
is an assignment of values to the variables with each 
value taken from the corresponding domain. 

As an example, consider a CP-net whose features 
are A, B, C and D, with binary domains containing 
/ and / if F is the name of the feature, and with the 
following preference statements: 



d : a y a, d : a y a, 

a : b yb, a :b y b, 

b : eye, b :c y c, 

c : d y d, c : d y d. 

Here the preference statement d : a y a states that 
A = a is preferred to A = a, given that D = d. 
From the structure of these preference statements 
we see that Pa{A) = {D},Pa(B) = {A},Pa(C) = 
{£>}, Pa(D) = {C} so the dependency graph is cyclic. 

An acyclic CP-net is one in which the dependency 
graph is acyclic. As an example, consider a CP-net 
whose features and domains are as above and with 
the following preference statements: 

a y a, 

byb, 

(a A b) V (a A b) : c y c, (a A 6) V (a A b) : c y c, 

c : d y d. c : d y d. 

Here, the preference statement a y a represents the 
unconditional preference for A = a over A = a. Also 
each preference statement for the variable C is a ac- 
tually an abbreviated version of two preference state- 
ments. In this example we have Pa(A) = 0, Pa(B) = 
0, Pa(C) = {A, B}, Pa(D) = {C}. 

The semantics of CP-nets depends on the notion of 
a worsening flip. A worsening flip is a transition 
between two outcomes that consists of a change in the 
value of a single variable to one which is less preferred 
in the unique preference statement for that variable. 
By analogy we define an improving flip. For exam- 
ple, in the acyclic CP-net above, passing from abed to 
abed is a worsening flip since c is better than c given 
a and b. We say that an outcome a is better than 
the outcome /3 (or, equivalently, j3 is worse than a), 
written as a y [3, iff there is a chain of worsening flips 
from ot to j3. This definition induces a strict preorder 
over the outcomes. In the above acyclic CP-net the 
outcome abed is worse than abed. 

An optimal outcome is one for which no better 
outcome exists. In general, a CP-nct does not need to 
have an optimal outcome. As an example consider two 
features A and B with the respective domains {a, a} 
and {&, 6} and the following preference statements: 

a : b yb, a :b y b, 

b : a y a, b : a y a. 

It is easy to see that then 

ab y ab yab yab y ab. 



3 Strategic games with parametrized 
preferences 

In this section we introduce a generalization of the 
notion of a strategic game used in game theory, see, 
e.g., 0. 

First we need the concept of a preference on a set 
A which in this paper denotes a strict linear ordering 
on A. If ^ is a preference, we denote by y the corre- 
sponding weak preference defined by: a y b iff a y b 
or a = b. 

Given a sequence of non-empty sets S%, . . .,S n and 
s G Si x . . . x S n we denote the ith element of s by Sj 
and use the following standard notation of game the- 
ory, where I := ii, ■ ■ .,ik is a subsequence of 1, . . ., n: 

• S — ^ . (si , . . Si — x j • * •) Sn ) : 

• si := (s n ,. . .,s ik ), 

• (s'^s-i) := (si,...,s i -i,s' i ,Si + i,...,s n ), where 
we assume that G Si, 

• S-i :=5i x ... x Si-i x S l+ i x ... x S n , 

• Si := S h x ... x S ifc . 

In game theory it is customary to study strategic 
games in which the outcomes are numerical values pro- 
vided by means of the payoff functions. A notable ex- 
ception is |Sj in which instead of payoff functions the 
linear quasi-orderings on the sets of joint strategies are 
used. 

In our setup we adopt a different approach accord- 
ing to which each player has to his disposal a strict 
preference relation >- (s_,) on his set of strategies 
parametrized by a joint strategy S-i of his opponents. 
So in our approach 

• for each i G [l..n] player i has a finite, non-empty, 
set Si of strategies available to him, 

• for each i G [l..n] and s-i G S—i player i has a 
preference relation K s -») on his set of strategies 
St. 

In what follows such a strategic game with 
parametrized preferences (in short a game with 
parametrized preferences, or just a game) for n 
players is represented by a sequence 

(Si, ■ . ., S n , y(s-i), . . ., y(s- n )), 

where each s-i ranges over S-i. 

It is straightforward to transfer to the case of games 
with parametrized preferences the basic notions con- 
cerning strategic games. In particular, given a game 



G with parametrized preferences specified as above we 
say that a joint strategy s is a (pure) Nash equilib- 
rium of G if for all i G [l..n] and all G Si 

s t y(s- t ) 4. 

(For the original definition see, e.g., [S].) 

To clarify this definitions consider the classical Pris- 
oner's dilemma strategic game represented by the fol- 
lowing bimatrix representing the payoffs to both play- 
ers: 



3,3 


0,4 


4,0 


1,1 



So each player i has two strategies, Ci (cooperate) and 
Ni (not cooperate), the payoff to player 1 for the joint 
strategy (Ci,^) is 0, etc. To represent this game 
as a game with parametrized preferences we simply 
stipulate that 

y(C 2 ) := NiyCi, y(N 2 ) -.= NiyCi, 
y(Ci) := N 2 y C 2 , y(Ni) := N 2 y G 2 . 
These orderings reflect the fact that for each strategy 
of the opponent each player considers his 'not cooper- 
ate' strategy better than his 'cooperate' strategy. It is 
easy to check that (Ni,N<z) is a unique Nash equilib- 
rium of this game with parametrized preferences. 

4 From CP-nets to strategic games 

Consider now a CP-net with the set of variables 
{Xi,...,X n } with the corresponding finite domains 
T>(Xi), . . ., T>(X n ). We write each preference state- 
ment for the variable Xi as Xj = a\ : y^, where for 
the subsequence / = i\, . . ., ik of 1, . . ., n: 

. Pa(X i ) = {X il ,...,X ik }, 

• Xi = a i is an abbreviation for X^ = 
a n A ... A X ik = a lk , 

• >~i is a preference over D(Xi). 

We also abbreviate V(X n ) x . . . x V(X lk ) to V(Xi). 

By definition, the preference statements for a vari- 
able Xi are exactly all statements of the form Xi = 
ai : y(ai), where aj ranges over T>(Xi) and y(ai) is 
a preference on V(Xi) that depends on a/. 

We now associate with each CP-net N a game Q(N) 
with parametrized preferences as follows: 

• each variable Xj corresponds to a player i, 

• the strategics of player i are the elements of the 
domain T>(Xi) of Xi. 



To define the parametrized preferences, consider a 
player i. Suppose Pa(Xi) = {Xi 1 , . . ., X ik } and let 
I := ii, . . ., if.. So I is a subsequence of 1, . . ., i — 1, i + 
1, . . ., n. Given a joint strategy a_i of the opponents 
of player i, we associate with it the preference relation 
y(ai) on T>(Xi) where Xj = a/ : >~(ai) is the unique 
preference statement for Xi determined by aj. 

In words, the preference of a player i over his strate- 
gies, given the strategies chosen by its opponents, say 
a—i, coincides with the preference given by the CP- 
net over the domain of Xi given the assignment to his 
parents aj which must coincide with the projection of 
a_i over /. This completes the definition of G(N). 

As an example consider the first CP-net of Section 
|5J The corresponding game has four players A, B, 
C, D, each with two strategies indicated with /, / 
for player F. The preference of each player on his 
strategics will depend only on the strategies chosen by 
the players which correspond to his parents in the CP- 
net. Consider for example player B. His preference 
over his strategies b and b, given the joint strategy of 
his opponents s_b = dac, is b y b. Notice that, for 
example, the same ordering holds for the opponents 
joint strategy s^g = dac, since the strategy chosen 
by the only player corresponding to his parent, A, has 
not changed. 

We have then the following result. 

Theorem 1 An outcome of a CP-net N is optimal iff 
it is a Nash equilibrium of the game Q(N). 

5 Prom strategic games to CP-nets 

We now associate with each game G with 
parametrized preferences a CP-net Af(G) as follows: 

• each player i corresponds to a variable Xj, 

• the domain T>(Xj) of the variable Xi consists of 
the set of strategies of player i, 

• we stipulate that Pa(Xi) = 
{Xi,Xi_x,...,X i+1 ,...,X n }, where n is the 
number of players in G. 

Next, for each joint strategy s_j of the opponents 
of player i we take the preference statement A"_; = 
S-i : y(s-i), where y(s-i) is the preference relation 
on the set of strategies of player i associated with s_,;. 

This completes the definition of Af(G). As an ex- 
ample of this construction let us return to the Pris- 
oner's dilemma game with parametrized preferences 
from Section [3] In the corresponding CP-net we have 



then two variables X\ and X2 corresponding to play- 
ers 1 and 2, with the respective domains {Ci,Ni} 
and {C2,A2j-. To explain how each parametrized 
preference translates to a preference statement take 
for example y{Ci) := Ni y C%. It translates to 
X 2 = C 2 : iVi >- Ci. 

We have now the following counterpart of Theorem^ 

Theorem 2 A joint strategy is a Nash equilibrium of 
the game G iff it is an optimal outcome of the CP-net 
Af(G). 

6 Reduced CP-nets 

The disadvantage of the above construction of the 
CP-net Af(G) from a game G is that it always produces 
a CP-net in which all sets of parent features are of size 
n — 1 where n is the number of features of the CP-net. 
This can be rectified by reducing each set of parent 
features to a minimal one as follows. 

Given a CP-net N, consider a variable Xi with the 
parents Pa(Xi), and take a variable Y £ Pa(Xi). Sup- 
pose that for all assignments a to Pa(X) — {Y} and 
any two values 2/1,1/2 £ "D(Y), the ordcrings y(a,yi) 
and >-(a, 2/2) on V(Xi) coincide. 

We say then that Y is redundant in the set of par- 
ents of Xi. It is easy to see that by removing all re- 
dundant variables from the set of parents of Xi and 
by modifying the corresponding preference statements 
for Xi accordingly, the strict preorder y over the out- 
comes of the CP-nets is not changed. 

Given a CP-net, if for all its variable Xi the set 
Pa(Xi) does not contain any redundant variable, we 
say that the CP-net is reduced. 

By iterating the above construction every CP-net 
can be transformed to a reduced CP-net. As an ex- 
ample consider a CP-net with three features, X, Y 
and Z, with the respective domains {ai, 02}, {61, 62} 
and {01,02}. Suppose now that Pa{X) = Pa(Y) = 
®,Pa(Z) = {X,Y} and that 

y(ax,b%) = y(a 2 ,b 1 ), y(a 1 ,b 2 ) = y(a 2 ,b 2 ), 

y(ai,bi) = y(ai,b 2 ), y(a 2 ,b 1 ) = y(a 2 ,b 2 ). 
Then both X and Y are redundant, so we can re- 
duce the CP-net by reducing Pa{Z) to 0. Z becomes 
an independent variable in the reduced CP-net with 
an ordering over its domain which coincides with the 
unique one given in the original CP-net in terms of the 
assignments to its parents. 

In what follows for a CP-net N we denote by r(N) 
the corresponding reduced CP-net. The following re- 
sult summarizes the relevant properties of r(N) and 
relates it to the constructions of G(N) and Af(G). 



Theorem 3 

(i) Each CP-net JV and its reduced form N' = r(N) 
have the same ordering y over the outcomes. 

(ii) For each CP-net N and its reduced form N' = 
r(N) we have Q(N) = G(N'). 

(Hi) Each reduced CP-net N is a reduced CP-net cor- 
responding to the game Q(N). Formally: JV = 
r(M(Q(N))). 

Part (i) states that the reduction procedure pre- 
serves the ordering over the outcomes. Part (ii) states 
that the construction of a game corresponding to a CP- 
net does not depend on the redundancy of the given 
CP-net. Finally, part (Hi) states that the reduced CP- 
net JV can be obtained 'back' from the game G(N). 

Games G such that the CP-net JV' = r(Af(G)) is 
acyclic are not uncommon. In fact, they naturally 
represent multi-agent scenarios where agents (that is, 
players of the game) can be partitioned into levels 
1, 2, . . . , n, such that agents at level i can express their 
preferences (that is, payoff function) without looking 
at what players at higher levels do. Informally, agents 
at level i are more important than agents at level j 
is j > i. In particular, agents at level 1 can decide 
their preferences without looking at the behavior of 
any other agent. 

7 Game-theoretic techniques in CP- 
nets 

Given the correspondence between CP-nets and 
games and its properties presented in the previous sec- 
tions, we can now use them to transfer standard tech- 
niques of game theory, used to find Nash equilibria, to 
CP-nets to find their optimal outcomes. 

More specifically, we can transfer two techniques of 
iterated elimination of 'suboptimal' strategies — those 
that are strictly dominated or are never best responses 
(see, e.g., UJ.) To introduce them in the context of 
CP-nets consider a CP-net JV with the set of variables 
{Xi, . . ., X n } with the corresponding finite domains 
V(X 1 ),...,V(X n ). 

• We say that an element di from the domain T>(Xi) 
of the variable Xi is a best response to a pref- 
erence statement 

Xi = ai : y. 



• We say that an element di from the domain of the 
variable Xi is a never a best response if it is 
not a best response to any preference statement 
for Xi. 

• Given two elements di, d\ from the domain T>(Xj) 
of the variable Xi we say that d\ is strictly 
dominated by di if for all preference statements 
Xi = a i : y^ for Xi we have 

di >-i d'i. 

By a subnet of a CP-net JV we mean a CP-net ob- 
tained from JV by removing some elements from some 
variable domains followed by the removal of all pref- 
erence statements that refer to a removed element. 

Then we introduce the following relation between a 
CP-net JV and its subnet JV': 

N ¥ NBR N' 

when JV ^ JV' and for each variable Xi each removed 
element from the domain of Xi is never a best response 
in JV, and introduce an analogous relation JV — >g JV' 
for the case of strictly dominated elements. 
The following result then holds. 

Theorem 4 Suppose that N -^^ BR N' , i.e., the CP- 
net JV' is obtained by an iterated elimination of never 
best responses from the CP-net JV. 

( i ) Then s is an optimal outcome of N iff it is an 
optimal outcome of N' . 

(ii) If each variable in N' has a singleton domain, 
then the resulting outcome is a unique optimal 
outcome of JV. 

(Hi) All iterated eliminations of never best responses 
from the CP-net JV yield the same final outcome. 

To illustrate the use of this theorem reconsider the 
first CP-net from Section[2 i.e., the one with the pref- 
erence statements 

d : a >-a, d : a ya, 

a : b y b, a :b y b, 

b : eye, b :c y c, 

c : d y d, c : d y d. 
Denote it by JV. 

We can reason about it using the iterated elimina- 
tion of strictly dominated strategies (which coincides 
here with the iterated elimination of never best re- 
sponses, since each domain has exactly two elements). 

We have the following chain of reductions: 

JV ^s^i -^ S N 2 ^ S N 3 ^ S JV 4 , 



for X, if d, y, for all d\ G V(X{). 



where 



• Ni results from N by removing a (from the do- 
main of A) and the preference statements d : a y 
a, d : a >~ a, a : b >- b, 

• N 2 results from N± by removing b and the prefer- 
ence statements a : b >~ b, b :c>- c, 

• N 3 results from 7V 2 by removing c and the prefer- 
ence statements b : c >- c c : d >- d, 

• N4 results from 7V3 by removing d from the do- 
main of D and the preference statement c : d >~ d. 

Indeed, in each step the removed element is strictly 
dominated in the considered CP-net. So using the it- 
erated elimination of strictly dominated elements we 
reduced the original CP-net to one in which each vari- 
able has a singleton domain and consequently found a 
unique optimal outcome of the original CP-net N. 

Finally, the following result shows that the intro- 
duced reduction relation on CP-nets is complete for 
acyclic CP-nets. 

Theorem 5 For each acyclic CP-net N a unique sub- 
net N with the singleton domains exists such that 
N^nbrN'- 

8 Conclusions and future work 

In this paper we related two formalisms that are 
commonly used in reasoning about multi-agent sys- 
tems, strategic games and CP-nets. To this end we 
generalized the concept of strategic games to games 
with parametrized preferences and showed that opti- 
mal outcomes in CP-nets are exactly Nash equilibria of 
such games. This allowed us to exploit game-theoretic 
techniques in search for optimal outcomes of CP-nets. 

Our current research deals two other aspects con- 
cerning strategic games and preferences. First, thanks 
to the established correspondence, we can also use the 
techniques developed to reason about optimal out- 
comes of a CP-net in search for Nash equilibria of 
strategic games with parametrized preferences. These 
techniques, as recently shown in JHEjj mvorve the use 
of the customary constraint solving techniques. In 
fact, it has been shown that the optimal outcomes of 
any CP-net, even a cyclic one, can be found by just 
solving a set of hard constraints. Thus hard constraint 
solving is enough to find also Nash equilibria in strate- 
gic games. 

Second, we found that the direct correpondence be- 
tween the optimal solutions of a CP-net and the Nash 
equilibria of the corresponding game cannot be easily 



found in other preference modelling formalisms, for ex- 
ample soft constraints, see pp. In fact, it is possible 
to show that, in a so-called fuzzy constraint problem, 
there can be optimal solutions which are not Nash 
equilibria of corresponding games, and vice-versa. We 
are therefore studying the conditions under which soft 
constraints can be related to game theory 

In this paper we assumed that payoff functions give 
a linear order over the strategies of a player. It could 
be useful to see whether our results can be generalized 
to games in which players' strategies can be incompa- 
rable or indifferent to each other, thus using partial 
ordcrings with ties. We arc currently studying this 
scenario. 

This paper is just a first step towards what we 
think is a fruitful cross-fertilization between prefer- 
ences, constraint solving, and game theory. 
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